PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa01g002940.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 702aa    MW: 78421.5 Da    PI: 5.4657
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa01g002940.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox57.62.2e-182579256
                    T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox  2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                    r+ +++t++q++ Le++F ++++p++++r++L ++l+L  +qVk+WFqN+R++ k
  Csa01g002940.1 25 RTGHRHTPQQIQGLEAFFMECPHPDEAQRQQLCEELKLGLNQVKFWFQNKRTQCK 79
                    555789*********************************************9987 PP

2START115.86.4e-372184374206
                     HHHHHHHHHHHHHC-TT-EEEE.......EXCCTTEEEEEEESSS...SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEE CS
           START   4 eeaaqelvkkalaeepgWvkss.......esengdevlqkfeeskv..dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevi 85 
                      +a +el ++   ee++Wvkss       +sen+++    +++ ++   ++e +++++vv+ ++++l e +ld   +W+e ++    +a+tl v+
  Csa01g002940.1 218 ASAVEELKRLFFTEEQFWVKSSidgtdviDSENYEKFSNAVKKFRSmsAHVESSKDVTVVPIEATNLIEMFLDAE-KWKELFPtmvnQAKTLHVL 311
                     66778888888899***********999999999998777755444778************************99.99998888888******** PP

                     CTT.......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE- CS
           START  86 ssg.......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdl 172
                      s        ++l +m  +l  lsplvp R+f++vR++++ ++g w+i+dvS ++  +   ++s+    ++pSg+li++++n  skv w+ehv++
  Csa01g002940.1 312 GSElpirencNILRVMWEQLHILSPLVPpREFMIVRCCQEISKGLWIIADVSHNVYFDFV-NASC---YKRPSGCLIQSLPNAQSKVMWIEHVEV 402
                     ****************************************************99998887.4555...559************************ PP

                     -SSXX.HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 173 kgrlp.hwllrslvksglaegaktwvatlqrqcek 206
                     + +l  h +++ l++ g   gak+w atl+r ce+
  Csa01g002940.1 403 GHKLDtHKIFKELLSGGSGYGAKRWIATLERMCER 437
                     ****99***************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.602.3E-18779IPR009057Homeodomain-like
SuperFamilySSF466892.59E-17981IPR009057Homeodomain-like
PROSITE profilePS5007115.2562181IPR001356Homeobox domain
SMARTSM003891.8E-172385IPR001356Homeobox domain
CDDcd000861.13E-162482No hitNo description
PfamPF000466.8E-162679IPR001356Homeobox domain
PROSITE profilePS5084839.844206440IPR002913START domain
SuperFamilySSF559617.97E-26208438No hitNo description
CDDcd088751.58E-85211436No hitNo description
SMARTSM002341.2E-19215437IPR002913START domain
PfamPF018522.3E-30218437IPR002913START domain
Gene3DG3DSA:3.30.530.201.8E-6243404IPR023393START-like domain
SuperFamilySSF559613.85E-5458665No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 702 aa     Download sequence    Send to blast
MENYGSGSSC NEQYASEDSK QSGKRTGHRH TPQQIQGLEA FFMECPHPDE AQRQQLCEEL  60
KLGLNQVKFW FQNKRTQCKV QEEKAGNLSL RGQNEILKSE NEAMHEALSS VLCPDCGGPP  120
FGREERALNF QKLLLENARL KEQRDKISHF LSNMSKRLTV GGSLASVPTQ HIFDQISSYG  180
INPSTMFDPS SSFGPPTSQP IQPQLFQMDV SLLSETAASA VEELKRLFFT EEQFWVKSSI  240
DGTDVIDSEN YEKFSNAVKK FRSMSAHVES SKDVTVVPIE ATNLIEMFLD AEKWKELFPT  300
MVNQAKTLHV LGSELPIREN CNILRVMWEQ LHILSPLVPP REFMIVRCCQ EISKGLWIIA  360
DVSHNVYFDF VNASCYKRPS GCLIQSLPNA QSKVMWIEHV EVGHKLDTHK IFKELLSGGS  420
GYGAKRWIAT LERMCERMAL TSNLTLPASD WSEVIRTGEE RRRVLKLGER MIMNFNEMLT  480
MSGKVDFPQQ SKCGVRVSMR INLEAGQPRG LIVSAASSFP IPLPPVQVFD NLRKLDPRQQ  540
WDVLAYGTVV TEIARVATGS SETNCLSILR PTQEENNGKL VAEDSDKGDM LMLQDCYMDA  600
LGGMLVYAPM DMTTMDTTLT GADVEISDIP ILPSGFIISS DGRRSTVEDG GTLLTLAFQI  660
LVSGNTNRAR DVNENSVNTV STLISSTVQR IKGLLNCPDQ C*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010507998.10.0PREDICTED: homeobox-leucine zipper protein HDG8 isoform X1
SwissprotQ9M9P40.0HDG8_ARATH; Homeobox-leucine zipper protein HDG8
TrEMBLR0HRS60.0R0HRS6_9BRAS; Uncharacterized protein
STRINGAT3G03260.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM127911530
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G03260.10.0homeodomain GLABROUS 8